Search for "html::treebuilder::xpath"

HTML::TreeBuilder::XPath - add XPath support to HTML::TreeBuilder

23 ++

This module adds typical XPath methods to HTML::TreeBuilder, to make it easy to query a document....

MIROD /HTML-TreeBuilder-XPath-0.14 - 20 Sep 2011 01:46:15 UTC - Search in distribution

lib/HTML/Robot/Scrapper/Parser/HTML/TreeBuilder/XPath.pm

++

HERNAN /HTML-Robot-Scrapper-0.11 - 31 Oct 2013 12:12:41 UTC - Search in distribution

HTML::Robot::Scrapper - Your robot to parse webpages

WWW::GoKGS::LibXML - HTML::TreeBuilder::LibXML-based WWW::GoKGS

++

This class inherits all methods from WWW::GoKGS. Unlike "WWW::GoKGS", this class uses HTML::TreeBuilder::LibXML instead of HTML::TreeBuilder::XPath to parse HTML documents. Make sure to install the alternative module in addition to this module....

ANAZAWA /WWW-GoKGS-0.21 - 21 Aug 2014 02:27:48 UTC - Search in distribution

WWW::GoKGS::Scraper - Abstract base class for KGS scrapers

Class::XPath - adds xpath matching to object trees

++

This module adds XPath-style matching to your object trees. This means that you can find nodes using an XPath-esque query with "match()" from anywhere in the tree. Also, the "xpath()" method returns a unique path to a given node which can be used as ...

SAMTREGAR /Class-XPath-1.4 - 29 Feb 2004 23:01:16 UTC - Search in distribution

Web::Scraper - Web Scraping Toolkit using HTML and CSS Selectors or XPath expressions

42 ++

Web::Scraper is a web scraper toolkit, inspired by Ruby's equivalent Scrapi. It provides a DSL-ish interface for traversing HTML documents and returning a neatly arranged Perl data structure. The *scraper* and *process* blocks provide a method to def...

MIYAGAWA /Web-Scraper-0.38 - 20 Oct 2014 00:27:05 UTC - Search in distribution

Web::Scraper::LibXML - Drop-in replacement for Web::Scraper to use LibXML

HTML::TreeBuilder::LibXML - HTML::TreeBuilder and XPath compatible interface with libxml

8 ++

HTML::TreeBuilder::XPath is libxml based compatible interface to HTML::TreeBuilder, which could be slow for a large document. HTML::TreeBuilder::LibXML is drop-in-replacement for HTML::TreeBuilder::XPath. This module doesn't implement all of HTML::Tr...

TOKUHIROM /HTML-TreeBuilder-LibXML-0.26 - 19 Oct 2016 15:08:57 UTC - Search in distribution

HTML::TreeBuilder::LibXML::Node - HTML::Element compatible API for HTML::TreeBuilder::LibXML

xml_grep - grep XML files looking for specific elements

62 ++

xml_grep does a grep on XML files. Instead of using regular expressions it uses XPath expressions (in fact the subset of XPath supported by XML::Twig) the results can be the names of the files or XML elements containing matching elements....

MIROD /XML-Twig-3.52 - 23 Nov 2016 17:21:16 UTC - Search in distribution

XML::Twig - A perl module for processing huge XML documents in tree mode.

WWW::Ruten - Scripting www.ruten.com.tw

++

GUGOD /WWW-Ruten-0.03 - 30 Aug 2011 11:49:40 UTC - Search in distribution

Web::Query - Yet another scraping library like jQuery

30 ++

Web::Query is a yet another scraping framework, have a jQuery like interface. Yes, I know Ingy's pQuery. But it's just alpha quality. It doesn't work. Web::Query built at top of the CPAN modules, HTML::TreeBuilder::XPath, LWP::UserAgent, and HTML::Se...

YANICK /Web-Query-1.01 - 16 Jan 2024 20:28:14 UTC - Search in distribution

Web::Query::LibXML - fast, drop-in replacement for Web::Query

HTML::Tree::AboutTrees - article on tree-shaped data structures in Perl

37 ++

The following article by Sean M. Burke first appeared in *The Perl Journal* #18 and is copyright 2000 The Perl Journal. It appears courtesy of Jon Orwant and The Perl Journal. This document may be distributed under the same terms as Perl itself....

KENTNL /HTML-Tree-5.07 - 31 Aug 2017 08:53:16 UTC - Search in distribution

Task::BeLike::LESPEA - Modules that LESPEA uses on a daily basis

++

LESPEA /Task-BeLike-LESPEA-2.005000 - 12 Mar 2014 14:47:57 UTC - Search in distribution

HTML::Linear - represent HTML::Tree as a flat list

4 ++

SYP /HTML-Untemplate-0.019 - 23 Jun 2014 08:41:42 UTC - Search in distribution

HTML::Untemplate - web scraping assistant
HTML::Linear::Path - represent paths inside HTML::Tree
HTML::Linear::Element - represent elements to populate HTML::Linear

XML::Lenient - extracts strings from HTML, XML and similarly tagged text.

++

What XML::Lenient is meant to parse markup languages such as HTML and XML in the knowledge that someone, somewhere, is going to break every rule in the book. It will handle malformed XML, wrongly nested HTML tags and everything else that I have throw...

DAVIES /XML-Lenient-1.0.1 - 15 Nov 2016 13:27:29 UTC - Search in distribution

Task::BeLike::TOKUHIROM - modules I use

++

This Task installs modules that I need to work with. They are listed in this distribution's cpanfile....

TOKUHIROM /Task-BeLike-TOKUHIROM-0.02 - 20 Mar 2014 01:35:41 UTC - Search in distribution

WWW::Scraper::Lite

++

RPETTETT /WWW-Scraper-Lite-15 - 02 Jun 2011 21:47:25 UTC - Search in distribution

WWW::Tabela::Fipe - Baixe a tabela fipe completa mantenha-se atualizado

++

Este módulo baixa a tabela FIPE atualizada para motos caminhoes e carros. Direto do site da FIPE. Fonte: fipe.org.br Downloads the FIPE table updated directly from fipe source. DataSource: fipe.org.br POD ERRORS Hey! The above document had some codin...

HERNAN /WWW-Tabela-Fipe-0.002 - 31 Oct 2013 12:12:52 UTC - Search in distribution

HTML::Encapsulate - rewrites an HTML page as a self-contained set of files

++

The main motivation for this module is for archiving and printing web pages: these typically come in various separate pieces and aren't simple to download as one chunk. However, it is possible to preserve the content of a web page, but to rewrite the...

NPW /HTML-Encapsulate-v0.3.0 - 13 Nov 2015 11:59:12 UTC - Search in distribution

XML::LibXML::jQuery - Fast, jQuery-like DOM manipulation over XML::LibXML

2 ++

XML::LibXML::jQuery is a jQuery-like DOM manipulation module build on top of XML::LibXML for speed. The goal is to be as fast as possible, and as compatible as possible with the javascript version of jQuery. Unlike similar modules, web fetching funct...

CAFEGRATZ /XML-LibXML-jQuery-0.08 - 23 Jul 2016 17:53:08 UTC - Search in distribution

Text::Corpus::Summaries::Wikipedia - Creates corpora for summarization testing.

++

"Text::Corpus::Summaries::Wikipedia" creates corpora for single document summarization testing using the featured articles of various Wikipedias. A criterion for an article in a Wikipedia to be *featured* is that it have a well written lead section, ...

KUBINA /Text-Corpus-Summaries-Wikipedia-0.22 - 25 Feb 2013 12:52:59 UTC - Search in distribution

XML::XPathEngine - a re-usable XPath engine for DOM-like trees

4 ++

This module provides an XPath engine, that can be re-used by other module/classes that implement trees. In order to use the XPath engine, nodes in the user module need to mimick DOM nodes. The degree of similitude between the user tree and a DOM dict...

MIROD /XML-XPathEngine-0.14 - 17 May 2013 02:49:03 UTC - Search in distribution

	Global
`s`	Focus search bar
`?`	Bring up this help dialog

	GitHub
`g` `p`	Go to pull requests
`g` `i`	go to github issues (only if github is preferred repository)

	POD
`g` `a`	Go to author
`g` `c`	Go to changes
`g` `i`	Go to issues
`g` `d`	Go to dist
`g` `r`	Go to repository/SCM
`g` `s`	Go to source
`g` `b`	Go to file browse

	Search terms
module: (e.g. module:Plugin)
distribution: (e.g. distribution:Dancer auth)
author: (e.g. author:SONGMU Redis)
version: (e.g. version:1.00)

Search results for "html::treebuilder::xpath"

HTML::TreeBuilder::XPath - add XPath support to HTML::TreeBuilder River stage three • 56 direct dependents • 200 total dependents 23 ++ 23 ++

lib/HTML/Robot/Scrapper/Parser/HTML/TreeBuilder/XPath.pm River stage one • 1 direct dependent • 1 total dependent ++ ++

WWW::GoKGS::LibXML - HTML::TreeBuilder::LibXML-based WWW::GoKGS River stage zero No dependents ++ ++

Class::XPath - adds xpath matching to object trees River stage one • 1 direct dependent • 1 total dependent ++ ++

Web::Scraper - Web Scraping Toolkit using HTML and CSS Selectors or XPath expressions River stage two • 44 direct dependents • 70 total dependents 42 ++ 42 ++

HTML::TreeBuilder::LibXML - HTML::TreeBuilder and XPath compatible interface with libxml River stage two • 8 direct dependents • 19 total dependents 8 ++ 8 ++

xml_grep - grep XML files looking for specific elements River stage three • 83 direct dependents • 252 total dependents 62 ++ 62 ++

WWW::Ruten - Scripting www.ruten.com.tw River stage zero No dependents ++ ++

Web::Query - Yet another scraping library like jQuery River stage one • 7 direct dependents • 8 total dependents 30 ++ 30 ++

HTML::Tree::AboutTrees - article on tree-shaped data structures in Perl River stage three • 171 direct dependents • 972 total dependents 37 ++ 37 ++

Task::BeLike::LESPEA - Modules that LESPEA uses on a daily basis River stage zero No dependents ++ ++

HTML::Linear - represent HTML::Tree as a flat list River stage zero No dependents 4 ++ 4 ++

XML::Lenient - extracts strings from HTML, XML and similarly tagged text. River stage zero No dependents ++ ++

Task::BeLike::TOKUHIROM - modules I use River stage zero No dependents ++ ++

WWW::Scraper::Lite River stage zero No dependents ++ ++

WWW::Tabela::Fipe - Baixe a tabela fipe completa mantenha-se atualizado River stage zero No dependents ++ ++

HTML::Encapsulate - rewrites an HTML page as a self-contained set of files River stage zero No dependents ++ ++

XML::LibXML::jQuery - Fast, jQuery-like DOM manipulation over XML::LibXML River stage one • 2 direct dependents • 2 total dependents 2 ++ 2 ++

Text::Corpus::Summaries::Wikipedia - Creates corpora for summarization testing. River stage zero No dependents ++ ++

XML::XPathEngine - a re-usable XPath engine for DOM-like trees River stage three • 6 direct dependents • 210 total dependents 4 ++ 4 ++